All Questions
Tagged with character-encodingcommand-line
16 questions
1vote
1answer
98views
Command similar to ascii for ascii extended and/or for unicode?
ascii command in Linux is fast and great. It allows us to search for a character or for a code point and returns all relevant results for a given search. Is there something similar for ASCII extended (...
1vote
1answer
48views
Chaotic Command-line Interface Layout
When I type a long command on a command-line interface. Something strange may happen in the layout. The characters I typed don't show in lines correctly. Instead, they merge into 1 line or overwrite ...
2votes
1answer
834views
Is there any ligature-aware alternative for "pdfgrep" in command line?
I always use "pdfgrep" to search inside of multiple PDF files from the command line. But I met a problem: This ligature character "fi" (see https://www.compart.com/en/unicode/U+FB01)...
0votes
0answers
457views
Question marks in my kali linux shell
I am trying to do active reconnaissance in Kali Linux (4.19.0-kali3-amd64) using xprobe2 tool. So I typed the command xprobe2 espn.com (for example). I know that espn.com use "OpenBSD" ...
0votes
0answers
159views
Converting text file encodings
I have been having a horrible time with text files that mostly have cp1252 encoding, but some don't. When I try to iterate over them in Python, I'll hit a file with some other weird encoding and the ...
16votes
4answers
60kviews
Convert binary encoding that head and Notepad can read to UTF-8
I have a CSV file which is in binary character set but I have to convert to UTF-8 to process in HDFS (Hadoop). I have used the below command to check characterset. file -bi filename.csv Output : ...
2votes
0answers
668views
Generating 'ASCII-art' banners with arrows
I recently discovered figlet which generates ASCII-art banners. Joy! ... but, alas, I want a banner with an arrow on it. Now... $ figlet unicode → arrow _ _ //\ ...
5votes
4answers
8kviews
Prevent tail from modifying the charset of the terminal
In a Linux terminal (CentOS) I am using the command tail --follow=name my-rolling-file.log in order to see the logs of my application. Sometimes in the log, there is some binary data dumped (I dump ...
2votes
2answers
568views
Incorrect output from cp, rm, and so on
When I run, say, cp, I get output like the following: # cp -v Foo Bar âFooâ -> âBarâ What's up with the weird â characters? Why is the shell doing this? It looks like some kind of strange encoding ...
2votes
0answers
1kviews
Character encoding issue with my linux install?
I just installed Ubuntu 12.04.4 64 bits, all seemed to be ok, when I started to work on console I noticed: that I couldn't get some accented characters typed in to the console, for example, if I tried ...
5votes
3answers
10kviews
How to specify attachment encoding in mutt command line?
I'm trying to send an email with attachment from a perl script. First, I create the attachment (an xml file): open(XMLFILE, ">:utf8", $xmlfile); print XMLFILE "<?xml version=\"1.0\" encoding=\"...
36votes
3answers
101kviews
Characters encodings supported by more, cat and less
I have a text file encoded as following according to file: ISO-8859 text, with CRLF line terminators This file contains French's text with accents. My shell is able to display accent and emacs in ...
1vote
0answers
1kviews
Convert ASCII-8BIT to UTF-8 using OSX' .bash_profile
When running certain rails scripts in OSX' Terminal, I get warnings such as the following: unable to convert "\xA9" from ASCII-8BIT to UTF-8 for tmp/cache/assets/CC7/4D0/sprockets%...
7votes
5answers
16kviews
How to convert to HTML code?
Are there any scripts that can convert between text (e.g. <hi>) and the html entities version (<hi>) like this website does? Or at least a PHP file?
4votes
2answers
19kviews
Converting text into ASCII/ISO-8859-1
I'm looking for some tool that can convert text, ideally from UTF-8 (but ISO-8859-2 and WINDOWS-1250 would be fine) into ASCII/ISO-8859-1? I have seen some online transliteration tools but I need ...